Comparing Fuzzy, Probabilistic, and Possibilistic Partitions Using the Earth Mover's Distance

نویسندگان

  • Derek Anderson
  • Alina Zare
  • Stanton R. Price
چکیده

A number of noteworthy techniques have been put forth recently in different research fields for comparing clusterings. Herein, we introduce a new method for comparing soft (fuzzy, probabilistic and possibilistic) partitions based on the earth mover’s distance (EMD) and the ordered weighted average (OWA). The proposed method is a metric, depending on the ground distance, for all but possibilistic partitions. It is extremely flexible due to its EMD formulation, OWA aggregation and abstract concept of ground distance. In theory, our method is agnostic to the type (uncertainty) of soft partition, clustering algorithm, distance measure used in the clustering algorithm(s) and it is applicable to the clustering of both object and relational data. Validation is performed theoretically, experimentally and also in terms of computational complexity. Emphasis is placed on the set of possibilistic partitions, specifically noise and co-incident clusters, important cases that have received little-to-no attention to date in the comparing clusterings literature. Improvements are reported in terms of metric properties and computational complexity over existing extended concordance / discordance (e.g., soft Rand and Jaccard) approaches and improved design and robustness in comparison to existing transportation problem based approaches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Concordance indices for comparing fuzzy, possibilistic, rough and grey partitions

Many indices have been proposed in literature for the comparison of two crisp data partitions, as resulting from two different classifications attempts, two different clustering solutions or the comparison of a predicted vs. a true labelling. Crisp partitions however cannot model ambiguity, vagueness or uncertainty in class definition and thus are not suitable to model all cases where informati...

متن کامل

A possibilistic approach to risk aversion

Possibility theory was initiated by Zadeh in 1978 as an alternative to probability theory. Probability theory is not efficient in the study of those uncertainty situations in which phenomena occur with a small frequency. In such cases it is preferable to apply techniques offered by possibility theory.In foundation of possibility theory one started on a paralel line with probability theory. Rand...

متن کامل

The Earth Mover's Distance is the Mallows Distance: Some Insights from Statistics

The Earth Mover’s distance was first introduced as a purely empirical way to measure texture and color similarities. We show that it has a rigorous probabilistic interpretation and is conceptually equivalent to the Mallows distance on probability distributions. The two distances are exactly the same when applied to probability distributions, but behave differently when applied to unnormalized d...

متن کامل

Tsallis Entropy and Conditional Tsallis Entropy of Fuzzy Partitions

The purpose of this study is to define the concepts of Tsallis entropy and conditional Tsallis entropy of fuzzy partitions and to obtain some results concerning this kind entropy. We show that the Tsallis entropy of fuzzy partitions has the subadditivity and concavity properties. We study this information measure under the refinement and zero mode subset relations. We check the chain rules for ...

متن کامل

Probabilistic-Possibilistic Belief Networks

The interpretation of membership functions of fuzzy sets as statistical likelihood functions leads to a probabilistic-possibilistic hierarchical description of uncertain knowledge. The fundamental advantage of the resulting fuzzy probabilities with respect to imprecise probabilities is the ability of using all the information provided by the data. This paper studies the possibility of using fuz...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Trans. Fuzzy Systems

دوره 21  شماره 

صفحات  -

تاریخ انتشار 2013